Make diffusion model conditioning more flexible #521

arrjon · 2025-06-24T13:41:51Z

I introduced a new keyword concatenated_input to the subnet_kwargs in the diffusion model. This keyword controls how inputs—such as parameters, noise, and condition—are fed into the model’s subnet.

Previously, the model assumed all inputs were 1D vectors and concatenated them directly for the default MLP subnet. However, for more flexible architectures—such as subnets designed to preserve or induce spatial structures—this assumption doesn't hold. Now we can also return all inputs separately directly to the subnet.

codecov · 2025-06-24T13:57:01Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Files with missing lines	Coverage Δ
...w/networks/consistency_models/consistency_model.py	`97.79% <100.00%> (+0.13%)`	⬆️
...esflow/networks/diffusion_model/diffusion_model.py	`79.79% <100.00%> (+0.64%)`	⬆️
bayesflow/networks/flow_matching/flow_matching.py	`93.44% <100.00%> (+0.58%)`	⬆️

vpratz · 2025-06-24T13:57:06Z

Thanks for the PR, I think this is a reasonable idea for advanced use cases. As this is another instance of multi-input networks (even though it's inside the inference network this time, and does not involve the adapter), we might want to include this in the discussion in #517. We might also think about:

how to pass inputs: named (as a dictionary), or via position (as a tuple like you propose in the PR)
Consistency in our multi-step models. If we offer this possibility here, we might want to do the same in flow matching and consistency models.

Tagging @LarsKue for comment as well.

stefanradev93 · 2025-06-24T14:23:08Z

One very general approach would be to break free from the fixed names, such as "inference_variables", and actually allow for:

Marking different simulator outputs as either target variables, summary variables or inference conditions
Selecting a strategy for how different outputs of the same type are handled (e.g., concatenated, passed as a tuple, or passed as keyword arguments)
This can be handled with another abstraction, such as SimulatorOutput with a flexible scheme.

arrjon · 2025-06-24T16:11:09Z

Consistency in our multi-step models. If we offer this possibility here, we might want to do the same in flow matching and consistency models.

I agree @vpratz. I added the same logic to both models. I am open to suggestions how the tensors should be passed to the subnet.

To @stefanradev93 comment: I think, this flexibility is only needed for advanced users. So maybe we should not follow this general approach for now, as the fixed names help users to get started with BayesFlow.

stefanradev93 · 2025-06-24T16:17:18Z

Consistency in our multi-step models. If we offer this possibility here, we might want to do the same in flow matching and consistency models.

I agree @vpratz. I added the same logic to both models. I am open to suggestions how the tensors should be passed to the subnet.

To @stefanradev93 comment: I think, this flexibility is only needed for advanced users. So maybe we should not follow this general approach for now, as the fixed names help users to get started with BayesFlow.

Absolutely, this is definitely a 2.>1.x idea.

arrjon · 2025-07-08T13:02:39Z

For now, it is okay as it is @stefanradev93?

bayesflow/experimental/diffusion_model/diffusion_model.py

vpratz · 2025-07-08T14:21:09Z

I think we need to document somewhere how this can be used (i.e., which inputs are passed to the network if concatenate_subnet_input is False`), as this currently only exists in the code itself. I'd suggest we pass the inputs by name, as this eases up communication (only names, no order).

It would be good to have a test in place for the concatenate_subnet_input is False case.

arrjon · 2025-07-09T09:59:24Z

Thanks @vpratz for the suggestions. I added the documentation.

Regarding the test: As we do not have a network at the moment which can handle multiple inputs, I do not know a useful test for the concatenate_subnet_input=False case. Any suggestions?

vpratz · 2025-07-11T08:22:25Z

@arrjon Thanks for the changes!
I would propose to add a simple wrapper network that can handle the case to the test suite, similar to other dummy networks we have. It could just take the separate inputs, concatenate them and pass them to any other network as usual. The main point for me is that we are able to notice if we accidentally break the functionality, so a basic dedicated test should be sufficient.

arrjon · 2025-07-17T19:38:45Z

@vpratz I added the test. Now the merge should be ready! :)

vpratz · 2025-07-22T09:29:14Z

Thanks a lot @arrjon . Sorry for being slow to review, and to always coming up with new things I didn't notice before.

The build functions do not take the concatenate_subnet_input parameter into account when building the subnet. I'm not sure in when this is problematic and when not, but I think it would be good to pass the correct shapes there as well, to avoid weird problems down the line.

arrjon · 2025-07-22T15:13:29Z

I corrected the shape for the build functions. Please verify the implementation, locally the tests passed, but I am not too confident with the implementation.

bayesflow/networks/consistency_models/consistency_model.py

arrjon · 2025-07-23T13:01:24Z

@vpratz now also the tests passed :)

vpratz · 2025-07-23T16:02:42Z

Great, I'll take a final look tomorrow and then merge it.

The convention is to use parameter name with a `_shape` suffix.

The cost of the continuous models on the CI is too high

vpratz · 2025-07-24T09:09:53Z

I have changed the naming for the shapes, and created a reduced test for testing the setting, as the inference network tests are really slow.

arrjon · 2025-07-24T09:17:53Z

Thanks @vpratz!

arrjon added 2 commits June 24, 2025 15:32

diffusion model input

7f67c58

diffusion model input

eac0371

arrjon requested review from stefanradev93 and vpratz June 24, 2025 13:41

arrjon self-assigned this Jun 24, 2025

fix type info

2c6d91e

fix subnet input

c545d9e

add flexible subnet to consistency model and flow matching

1d57c76

arrjon added 2 commits June 25, 2025 09:23

improved naming

376e88e

fix subnet input dimensions

3ea64b7

vpratz reviewed Jul 8, 2025

View reviewed changes

bayesflow/experimental/diffusion_model/diffusion_model.py Outdated Show resolved Hide resolved

arrjon added 3 commits July 9, 2025 11:24

fix docstring

9d9a73c

fix function name

dc4ee7b

add documentation

0e3383b

arrjon added 3 commits July 17, 2025 16:11

Merge branch 'dev' into diffusion-model-conditioning

a1e6ef0

add test

703ac2d

fix test

90aa74d

arrjon added 2 commits July 22, 2025 16:27

fix input shape

ef1f47c

fix input shape

9fe12a8

arrjon added 2 commits July 22, 2025 17:30

add serializable

29416d0

add serializable

4202554

vpratz reviewed Jul 22, 2025

View reviewed changes

bayesflow/networks/consistency_models/consistency_model.py Outdated Show resolved Hide resolved

arrjon added 5 commits July 23, 2025 10:34

add compute_output_shape

2efed33

fix ConcatenateMLP

690d33b

Merge branch 'dev' into diffusion-model-conditioning

0a5a4c4

fix ConcatenateMLP

af8d711

fix time_shape

bece7cf

vpratz added 2 commits July 24, 2025 06:36

change build parameters to match Keras convention

f819250

The convention is to use parameter name with a `_shape` suffix.

change to simpler separate test

987ce60

The cost of the continuous models on the CI is too high

vpratz merged commit d9e9782 into dev Jul 24, 2025
9 checks passed

vpratz deleted the diffusion-model-conditioning branch July 24, 2025 09:14

Make diffusion model conditioning more flexible #521

Make diffusion model conditioning more flexible #521

Uh oh!

Conversation

arrjon commented Jun 24, 2025

Uh oh!

codecov bot commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vpratz commented Jun 24, 2025

Uh oh!

stefanradev93 commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arrjon commented Jun 24, 2025

Uh oh!

stefanradev93 commented Jun 24, 2025

Uh oh!

arrjon commented Jul 8, 2025

Uh oh!

Uh oh!

vpratz commented Jul 8, 2025

Uh oh!

arrjon commented Jul 9, 2025

Uh oh!

vpratz commented Jul 11, 2025

Uh oh!

arrjon commented Jul 17, 2025

Uh oh!

vpratz commented Jul 22, 2025

Uh oh!

arrjon commented Jul 22, 2025

Uh oh!

Uh oh!

arrjon commented Jul 23, 2025

Uh oh!

vpratz commented Jul 23, 2025

Uh oh!

vpratz commented Jul 24, 2025

Uh oh!

Uh oh!

arrjon commented Jul 24, 2025

Uh oh!

Uh oh!

codecov bot commented Jun 24, 2025 •

edited

Loading

stefanradev93 commented Jun 24, 2025 •

edited

Loading